Can relativity be considered complete ? 
Prom Newtonian nonlocality to quantum nonlocality and beyond 
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We review the long history of nonlocality in physics with special emphasis on the conceptual 
breakthroughs over the last few years. For the first time it is possible to study "nonlocality without 
signaling" from the outside, that is without all the quantum physics Hilbert space artillery. We 
emphasize that physics has always given a nonlocal description of Nature, except during a short 10 
years gap. We note that the very concept of "nonlocality without signaling" is totally foreign to the 
spirit of relativity, the only strictly local theory. 

PACS numbers: 



I. INTRODUCTION 

100 years after Einstein miraculous year and 70 years 
after the EPR paper |l|, I üke to think that Einstein 
would have appreciated the somewhat provocative title 
of this contribution. However, Einstein would probably 
not have liked its conclusion. But who can doubt that 
relativity is incomplete? and likewise that quantum me- 
chanics is incomplete! Indeed, these are two scientific 
theories and Science is nowhere near its end (as a matter 
of fact, I do believe that there is no end 0)- Well, ac- 
tually, I am, of course, not writing for Einstein, but for 
thosc rcadcrs intcrcsted in a (necessarily somewhat sub- 
jective) account of the peaceful co-existence between 
relativity and quantum physics in the light of the concep- 
tual and experimental progresses that happened during 
the last ten years, set in the broad perspective of physics 
and nonlocality since Newton 0. 



II. NON-LOCALITY ACCORDING TO 
NEWTON 

Isaac Newton, the great Newton of Universal Gravita- 
tion, was not entirely happy with his theory. Indeed, he 
was well aware of an awkward consequence of his theory: 
if a stone is moved on the moon, then our weight, of all 
of us, here on earth, is immediately modified. What trou- 
bled so much Newton was this immediate effect, i.e. the 
nonlocal prediction of his theory. Let's read how Newton 
described it himself 

That Gravity should be innate, inherent and essential 
to Matter, so that one Body may act upon another at a 
Distance thro a Vacuum, without the mediation of any 
thing else, by and through which their Action and Force 
may be conveyed from one to another, is to me so great an 
Absurdity, that I believe no Man who has in philosophi- 
cal Matters a competent Faculty of thinking, can ever fall 
into it. Gravity must be caused by an Agent acting con- 
stantly according to certain Laws, but whether this Agent 
be material or immaterial, I have left to the Considera- 
tion of my Readers. 

It would have been hard for Newton to be more explicit 



in his rejection of nonlocality! However, most physicists 
didn't pay much attention to this aspect of Newtonian 
physics. By lack of alternative, physics remained nonlo- 
cal until about 1915 when Einstein introduced the world 
to General Relativity. But let's start ten years earlier, in 
1905. 



III. EINSTEIN, THE GREATEST 
MECHANICAL ENGINEER 

In 1905 Einstein introduced three radically new the- 
ories or models in physics. Special relativity of course, 
but more relevant to this section are his descriptions of 
Brownian motion and of the photo-electric effect. Indeed, 
both descriptions show Einstein's deep intuition about 
mechanics. Brownian motion is explained as a com- 
plex series of billiard-ball-like-collisions between a visi- 
ble molecule - the particle undergoing Brownian motion 
- and invisible smaller partides. The random collisions 
of the latter explaining the erràtic motion of the former. 
Likewise, the photo-electric effect is given a mechanis- 
tic explanation. Light beams contain little billiard-balls 
whose energy depends on the color, i.e. wavelength, of 
the light. These light-billiard-balls (today called photons 
and recognized as not at all billiard-ball-like) hit the elec- 
trons on metallic surfaces and mechanically kick them out 
of the metal, provided they have enough energy. 

General relativity can also be seen as a mechanical 
description of gravitation. When a stone is moved on the 
moon, a bunch of gravitons (in modern terminology) fly 
off in all directions at a finite speed, the speed of light. 
Hence, about a second later, the earth is informed and 
only then is our weight affected. This is, I believe, the 
greatest achievement of Einstein, the greatest mechanical 
engineer 1 of all times: Einstein turned physics into 



1 My friends know well that in my mouth "engineer" has no nega- 
tive connotation, quite the opposite. For me, a physicist must be 
a good theorist and a good engineer! Well, I warned you, dear 
reader, this is a somewhat subjective article. 
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a local theory! 



IV. QUANTUM MECHANICS IS NOT 
MECHANICAL 

Only about ten years after general relativity came 
quantum mechanics. This was quite an extraordinary 
revolution. Until then, greatly thanks to Newton and 
Einstein's genius, Nature was seen as made out of 
many littlc billiard-balls that mechanically bang into each 
other. Yet, quantum mechanics is characterized by the 
very fact that it no longer gives a mechanical description 
of Nature. The terminology quantum mechanics is just a 
historical mistake, it should be called Quantum Physics 
as it is a radically new sort of physical description of 
Nature. 

But this new description let nonlocality back into 
Physics! And this was unacceptable for Einstcin. 

It is remarkable and little noticed that since Newton, 
physics gave a local description of Nature only during 
some 10 years, between about 1915 and 1925. All the 
rest of the time, it was nonlocal, though, with quan- 
tum physics, in quite a different sense as with Newton 
gravitation. Indeed, the latter implies the possibility of 
arbitrarily fast signaling, while the former prohibits it. 



V. NON-LOCALITY ACCORDING TO 
EINSTEIN 

In 1935 two celebrated papers appeared in respectable 
journals, both with famous authors, both stressing the - 
unacceptable in their authors view - nonlocal prediction 
of quantum physics Q, Q . A lot has been written on the 
EPR "paradox" and I won't add to this. I believe that 
Einstein's reaction is easy to understand. Hcrc is the man 
who turned physics local, centúries after Newton wrote 
his alarming text, he is proud of his achievement and cer- 
tainly deserves to be. Now, only a few years latter, non- 
locality reappears! Today one should add that quantum 
nonlocality is quite a different concept from Newtonian 
nonlocality, but Einstein did not fully realize this. 

What Einstein and his colleagues saw is that quan- 
tum physics describes spatially separated partides as one 
global system in which the two partides are not logically 
separated. What they did not fully realize is that this 
does not allow for signaling, hence it is not in direct con- 
fiict with relativity. In the next section l'li try to present 
this using modern terminology. 

Most physicists didn't pay much attention to this as- 
pect of quantum physics. A kind of consensus established 
that this was to be left for future examination, once the 
technology would be more advanced. The general feeling 
was that quantum nonlocality was nothing but a labora- 
tory curiosity, not serious physics. 

Young physicists may have a hard time to believe that 
such an important concept, like quantum nonlocality, 



was, during many dècades, not considered as serious. But 
this was indeed the real state of affairs: ask any older 
professors, a vast majority of them still believes that it 
is unimportant. Let me add two little stories that illus- 
trate what the situation was like. John Bell, the famous 
John Bell of the Bell inequalities and of the Bell states, 
never had any quantum physics student. When a young 
physicist would approach him and talk about nonlocal- 
ity, John's first question was: "Do you have a permanent 
position?" . Indeed, without such a permanent position it 
was unwise to dare talking about nonlocality! Notice that 
John Bell almost never publishcd any of his remarkable 
and nowadays famous papers Q in serious journals: the 
battle with referees were too ... time wasting (not to use a 
more direct terminology). Further, if you went to CERN 
where John Bell held a permanent position in the theory 
department and asked at random about John's contribu- 
tions to physics, his work on the foundation of quantum 
physics would barely be mentioned (true enough, he had 
so many other great contributions!) 2 . 

Anyway, so quantum nonlocality remained for dècades 
in the curiosity lab and no one paid much attention. But 
in the 1990's two things changed. First, a conceptual 
breakthrough happened thanks to Artur Ekert and to 
his adviser David Deutsch 0- They showed that quan- 
tum nonlocality could be exploited to establish a cryp- 
tographic key between two distant partners and that the 
confidentiality of the key could be tested by means of 
Bell's inequality. What a revolution! This is the first 
time that someone suggested that quantum nonlocality 
is not only real, but that it could even be of some use. 
A second contribution came from the progress in tech- 
nology. Optical fibers had been developed and installed 
all over the world. And Mandcl's group at the Univer- 
sity of Rochester (where I held a one-year post-doc posi- 
tion and first met with òptics) applied parametric down- 
conversion to produce entangled photon pairs 01 ■ This 
was enough (up to the detectors) to demonstrate quan- 
tum nonlocality outside the curiosity laboratory. In 1997 
my group at Geneva University demonstrated the vio- 
lation of Bell inequalities between two villages around 
Geneva, see Fig. 1, separated by a little more than 10 
km and linked by a 15km long Standard telecom fibcr 
[ïïl (since then, we have achieved 50km 0|). So 
quantum nonlocality became politically acceptable! But 
what is it? Let me introduce the concept using students 
undergoing " quantum exams" . 



2 Anothcr story happened to me while I was a young post-doc eager 
to publish some work. In a paper [j| I wrote " A quantum particle 
may disappear from a location A and simultaneously reappear in 
B, without any flow in-between". The referee accepted the paper 
under the condition that this outrageous sentence is removed. 
This referee considered his paternalist attitude so constructive 
that he declared himself to me: "look how helpful I am to you" 
(admittedly, he was politically correct). 
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VI. QUANTUM EXAMS: ENTANGLEMENT 

Assume that two students, Alice and Bob, have to pass 
some exams. As always for exams, the situation is ar- 
ranged in such a way that the students can't communi- 
catc during the exam. Clcarly however, they are allowed, 
and even encouraged, to communicate bcforehand. Alice 
and Bob know in advance the list of possible qüestions, 
they also know that this is a kind of exam allowing only 
a very limited number of possible answers, often only a 
binary choice between yes and no. During the exam Al- 
ice receives one question out of the list, let's denote it by 
x; Bob receives question y. Finally, denote a and b Alice 
and Bob's answers, respectively. Hence, an exam is a re- 
alization of a random process described by a conditional 
probability function, often merely called a correlation: 

P(a,b\x,y) (1) 

Clearly, the choice of qüestions x and y are under the 
professor's control. However, as all professors know, the 
students' answers a and b are not! This is similar to ex- 
periments: the choice as to which experiment to perform 
is under the physicists control, but not the answer given 
by Nature. 

In the following, we shall consider three kinds of exams, 
in order to understand what kind of constraints they set 
on the correlation P(a, b\x, y). 

A. Quantum exam #1 

In this first kind of quantum exam Alice is asked to 
teli which question is given to Bob, and vice-versa. This 
is clearly an unfair exam! Why? Because Alice and Bob 
are not supposed to communicate. How could they then 
succeed with a probability greater than mere chance 3 ? 
This simple example shows that prohibiting signaling al- 
ready limits the set of possible correlation P(a, b\x,y). 
For example P(a, b\x,y) = S(a = y)S(b = x) is excluded. 
Notice that a correlation P(a,b\x,y) is non-signaling if 
and only if its marginal probabilities are independent of 
the other side input: P{a,b\x,y) is independent of y 
and J2 a -P( a ; b\ x i y) is independent of x. 

B. Quantum exam #2 

The second kind of quantum exam is closer to Standard 
exams. Alice and Bob are simply requested to provide 
the same answer whenever they receive the same ques- 
tion. This is clearly feasible: we all expect that good 
students give the same answer to the same question. It 



sufnces that they prepare for the exam well enough. Note 
that the quantum exam #2 under consideration here is 
even easier than Standard exams, as there is no notion of 
correct or incorrect answers. All that is required is that 
Alice and Bob give consistent answers: it sufnces that 
they jointly decide in advance which answer to give for 
each of the possible qüestions. 

Now, a central problem: Could Alice and Bob suc- 
ceed with certainty for such an exam #2 by other means, 
that is without jointly deciding the answers in advance? 
Think about it. If you found an alternative trick, then, if 
you are a student, you should use your trick to pass the 
next examination: just apply your trick together with the 
best student, you'll get the same mark as him/her 4 . And 
if you are a professor and found a trick, then you should 
stop testing your student with Standard exams! Well, of 
course, there is no other trick, at least none applicable to 
classical students. 

Correlations that satisfy P(a = b\x = y) = 1 are nec- 
essarily of the form 

P(a, b\x, y) = Y, PWQ(o\x, X)Q(b\y, A) (2) 

A 

for some probability function Q and some distribution p 
of common strategy À. Historically, the A were called 
" local hidden variables" , computer scientist call them 
" shared randomness" ; here the A denote common strate- 
gies. 

P(a = b\x = y) = 1 is but one example of a local 
correlation, among infinitely many others. Rclation @ 
characterizes all local correlations. 

In summary, some exams require common strategies; 
in other words, some observed correlations can't be ex- 
plained except by common causes. 



C. Quantum exam #3 

The third kind of quantum exam is the most tricky 
and interesting. For (apparent) simplicity let's restrict 
the set of qüestions and answers to binary sets and let 
us label them by bits, "0" and "1". In this exam Alice 
and Bob are required to always output the same answer, 
except when they both receive the question labelled " 1" 
in which case they should output different answers. Note 
that formally this exam requires that Alice and Bob's 
data satisfy the following equality, modulo 2: a + b = x-y. 
This time it is not immediately obvious whether they can 
prepare a strategy that guaranties success. 

Assume first that the strategy forces Alice to output 
an answer that depends only on her input x, i.e. Alice's 
strategy is deterministic. But in such a case, whenever 



3 Somewhat surprisingly there is a strategy such that the proba- 
bility that both players succeed is 50%. 



4 Admittedly, the danger is that both student that get the bad 
mark! But, on average, the poor student improves. 
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Bob receives the question 1, he can't decide on his out- 
put since it should depend on Alice's question. Next, 
if Alice's output is random, this is clearly of no help to 
Bob. Consequently there is no way for Alice and Bob to 
succeed with certainty. 

Let us emphasize that successfully completing this 
exam does not necessarily imply communication between 
Alice and Bob. Indeed, assume that somehow Alice and 
Bob's data would always satisfy a + b = x ■ y. Would 
this allow Alice to communicate to Bob, or vice-versa? 
Well, it depends! If Alice's output a is known to Bob, 
for instance they decide on a = always, thcn whcnever 
Bob receives y = 1, he can deduce Alice's question from 
a + b — x ■ y and from his output: x = b in the example. 
But if Alice's outcome is unknown to Bob, for instance if 
Alice outcome is merely a random bit, then the relation 
a + b = x ■ y is of no help to Bob. We shall come back 
to this concept of a non-signaling corrclation satisfying 
a + b = x ■ y in section lX Cl 

Let us define the mark M of this quantum exam #3 
as the sum of the success probabilities 01 : 
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It is not difficult to realize that the optimal strategy for 
Alice and Bob consists in deciding in advance on a com- 
mon answer, independent of the qüestions they receive. 
With such a strategy they are able to achieve the mark 
M = 3. This is indeed the optimal mark achievable by 
commou strategies: 

M < 3 (4) 

This is an example of a Bell inequality: a constraint on 
correlations arising from common strategies. Interest- 
ing Bell inequalities are those that can be violated by 
quantum physics. In the case of Q), if Alice and Bob 
share singlets, then they can obtain the mark Mqp = 
2 + y2 w 3.41. Tsirelson proved that this is the highest 
mark achievable using quantum correlations |Ï5|. 

Accordingly, quantum theory predicts that some tasks 
can be achieved that can't be predicted by any local me- 
chanical model, i.e. some exams are passed with higher 
marks than classically possible. The fact that such tasks 
were invented for the purpose of showing the superiority 
of quantum physics doesn't affect the conclusion. Still, 
it is only once some useful and natural tasks were found, 
concretizing the superior power of quantum physics over 
all possible local strategies, that quantum nonlocality be- 
came accepted by the physics community 5 . 



I wish someone establishes the statistics of the occurrences of 
the words "Bell inequality" and "nonlocality" in Physical Re- 




FIG. 1: Bernex and Bellevue are the two villages north and 
south of Geneva between which our long-distance test of Bell 
inequality outside the lab was performed in 1997, section IV1 
The inset represent two player that toss coins, as explained in 
section lVllI In the real experiment the coins were replaced by 
photons, the players by interferometers, their right and left 
hands by phase modulators and head/tail by two detectors. 
The experimental results are similar to that of the game, with 
weaker but still nonlocal correlations. 



VII. COIN TOSSING AT A DISTANCE 

Another way to present nonlocality to non-physicist 
friends is the following. Imagine two hypothetical players 
that toss coins. The players are separated in space and 
toss their coin once per minute. They use their free-will 
to decide, for each toss, whether to use their right hand 
or their left hand, independently of each other. And they 
mark all results (time, hand and head/tail) in a big black 
laboratory notebook, see Fig. 1. 

After thousands of tosses, they get bored. Especially, 
given that nothing interesting happens: for each of the 
two players, heads and tails occur with a frequency of 
50%, independently of which hand they use. Hence, the 
players decide to go for a beer. There, in the bar, they 
compare their notes and get very excited. Indeed, quickly 
they notice that whenever at least one of thcm happened 
to have chosen his right hand for tossing his coin, both 
players always obtained the same result: cither both head 
or both tails. But whenever, by mere chance, they both 
chose the left hand, then they always obtained opposite 
results: head/tail or tail/head. A very remarkable corre- 
lation! 

The observation of correlations and the development 



view Letters. I bet that a phase transition happen in the early 
1990's, after Ekert's paper on quantum cryptography. In 1997 I 
started a PRL with the sentence: "Quantum theory is nonlocal." 
and got considerable reactions to what was felt as a provocative 
statement; today the same statement can be found in many pa- 
pers, not provoking any reaction. 
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of theoretical models explaining them is the essence of 
the scientific method. This is true not only in physics, 
but also in all other sciences, like geology and medicine 
for instance. John Bell used to say " correlations cry out 
for explanations!" |16|. 

So, why are our two players that excited by the correla- 
tion they observe? Note that locally, nothing interesting 
happens; in particular there is no way for one player to 
infer from his data which hand the other player chose. 
Even if one player decides to always use the same hand, 
this has no effect on the statistics observed by his col- 
league. Consequently, this game and the observed cor- 
relation do not imply any signaling. So, why do we feel 
that this is impossible? Actually, frankly, I do not know! 

Classical correlations are always explained by either of 
two kinds of causes. The first kind is "signaling", one 
player somehow informs or influences the other player. 
This is clearly not the case here, since we assumed the 
players were widely separated in space (for the physicists 
we may add " space-like separated" ) . The second kind of 
causes for classical correlations is a common cause. For 
example all football players simultaneously stop running, 
because the umpire whistled. This kind of cause is pre- 
cisely equivalent to the assumption of a common strat- 
egy, as formalized by @ and excluded for the present 
correlation by Bell's inequality (QJ. Consequently, the 
correlation observed by our two players is of a different 
nature. The big surprise is that anything beyond the two 
classical causes for correlation exists! This is what Ein- 
stein and many others had a hard time to believe. But, 
today, if one accepts this as a matter of theoretical pre- 
diction and experimental confirmation, then the next big 
question is "why can't the correlation observed by our 
hypothctical players not be observed in the real world?" . 
Indeed, quantum physics (and tensor products of Hilbcrt 
spaces) teli us that Bell's inequality (QJ can be violated, 
i.e. not all quantum correlations can be explained by one 
of the two kinds of classical causes for correlations, but 
quantum physics does not allow correlations as strong as 
observed by our hypothetical players. Still, this game is 
illustrative of quantum nonlocality, as we shall elaborate 
in section Ixi 



VIII. EXPERIMENTS: GOD DOES PLAY DICE, 
HE EVEN PLAYS WITH NONLOCAL DICE 

Physics is an experimental science and experiments 
have again and again supported the nonlocal predic- 
tions of quantum theory. All kind of experiments 
have been performed, in laboratories ^3 and outside 
[ïïl lï^ . lï^l . with photons and with massive partides [20| . 
with independent observers to close the locality loophole 
GUIU El E E3> with quasi-perfect detectors ^ to 
close the detection loophole, with high precision timing 
to bound the speed of hypothetical hidden communica- 
tio n l22l , with moving observers to test alternative mod- 
els |23| ímulti-simultaneitv |24| and Bohm's pilot wave 



[2^| ) 6 . All these results proclaim loudly: God plays 
dice. Note how irònic the situation is: the conclusion 
"God plays dice" is imposed on us by the experimental 
evidence supporting quantum nonlocality and by Ein- 
stein's postulate that no information can travel faster 
than light. Indeed, as mentioned in sub-section I VI Cl a 
violation of |0J with deterministic outputs leads to sig- 
naling. Consequently, the experimental violation oi_J@ 
and the no-signaling principle imply randomness |26ll27| . 

Actually, the situation is even more interesting: Not 
only does God play dice, but he plays with nonlocal dice! 
The same randomness manifests itself at several 
locations, approximating a + b w x ■ y better than pos- 
sible with any local classical physics model. 

A very small minority of physicists still refuse to ac- 
cept quantum nonlocality. They ask (sometimes with 
anger) How can these two space-time locations, out there, 
know about what happens in each other without any sort 
of communication ?. I believe that this is an excellent 
question! I have slept with it for years [28(. I summarize 
my conclusion in the next section. 

IX. ENTANGLEMENT AS A CAUSE OF 
CORRELATION 

Quantum physics predicts the existence of a totally 
new kind of correlation that will never have any kind of 
mechanical explanation. And experiments confirm this: 
Nature is able to produce the same randomness at several 
locations, possibly space-like separated. The Standard 
explanation is " entanglement" , but this is just a word, 
with a precise technical definition [2^ . I3(Í| . Still words 
are useful to name objects and concepts. However, it re- 
mains to understand the concept. Entanglement is a new 
explanation for correlations. Quantum correlations sim- 
ply happen, as other things happen (fire burns, hitting a 
wall hurts, etc). Entanglement appears at the same con- 
ceptual level as local causes and effects. It is a primitive 
concept, not reducible to local causes and effects. Entan- 
glement describes correlations without correlata [3Ï| in a 
holistic view [3Ï| • In other worlds, a quantum corre- 
lation is not a correlation between 2 events, but 
a single event that manifests itself at 2 locations. 

Are you satisfied with my explanation of what entan- 
glement is? Well, I am not entirely! But what is clear 
is that entanglement exists. Moreover, entanglement is 
incredible robust! The last point might come as a sur- 
prise, since it is still oftcn claimcd that entanglement is 
as elusive as a dream: as soon as you try to talk about 
it, it evaporates! Historically this was part of the suspi- 
cion that entanglement was not really real, nothing more 



The conclusion that follows from all these experiments is so im- 
portant for the physicist's world-view, that an experiment clos- 
ing simultaneously both the locality and the detection loophole 
is greatly needed. 
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than some exòtic partides that live for merely a tiny 
fraction of a second. But today we see a growing num- 
ber of remarkable experiments mastering en tanglem ent. 
Entanglement over long distances ITïl [Tj 19, 331, en- 
tanglement between many photons |34j and many ions 
, entanglement of an ion and a photon j^E > en- 
tanglement of mesoscopic systems (more precisely entan- 
glement between a few collective modes carried by many 
partides) [Ülllli^, entanglement swapping [HEllZif . 
the transfer of entanglement between different carriers 
0, etc. 

In summary: entanglement exists and is going to affect 
future technology. It is a radically new concept, requiring 
new words and a new conceptual category. 



X. FROM QUANTUM NONLOCALITY TO 
MERE NONLOCALITY 

So far we have seen that quantum physics produces 
nonlocal correlations. And so what? Ok, this can be used 
for Quantum Key Distribution and other Quantum Infor- 
mation processes, but that doesn't help much to under- 
stand non-locality. Conceptually, one would like to study 
non-locality without all the quantum physics infrastruc- 
ture: Hilbert spaces, observables and tensor products. 
Not too surprisingly, once the existence of non-locality 
was accepted, the conceptual tools to study it came very 
naturally. Actually, the tools were already there, in the 
mathematics ^BJ and even the physics j2|| [2^ litera- 
ture, waiting for a community to wake up! The bàsic 
tool is simple, doesn't require any knowledge of quan- 
tum physics and allows one, so to say, to study quan- 
tum nonlocality "from the outside" , i.e. from outside the 
quantum physics infrastructure. 

Let us go back to the quantum exam #3 (subsection 
IV! Cf) . Assume that Alice and Bob are not restricted 
by quantum physics, but only restricted by no-signaling. 
Consequently, they would fail the quantum exam #1. 
But under this mild no-signaling condition they could 
perfectly succeed in the quantum exam #3: Alice and 
Bob would each output a bit which locally looks perfectly 
random and independent from their inputs - hence there 
would be no signaling - yet their 2 bits would satisfy 
a + b = x ■ y, exactly as in the coin tossing game of 
section lYlII The hypothetical "machine" that produces 
precisely this correlation is a bàsic example of the kind 
of conceptual tools we need to study nonlocality without 
quantum physics. Formally, the correlation function is 
defined by: 

P{a,b\x,y) = -S(a + b = x-y) (5) 

where the 5(z\ = Z2) function takes value 1 for z\ = Z2 
and value otherwise. 

The correlation (J5J is often referred to as a PR-box, to 
recali the seminal work by Popescu and Rohrlich [26ll27j| . 



or as a NL-machine (a Non-Local machine 7 ). The idea 
of these terminologies is to emphasize the similarities be- 
tween quantum measurements on 2 maximally entangled 
qubits and the correlation JSJ: in both cases the outcome 
is available as soon as the corresponding input is given 
(Alice knows a as soon as she inputs x into her part of the 
machine and similarly Bob knows b as soon as he inputs 
y, there is no need to wait for the other's input) and in 
both the quantum and the PR-box cases the "machine'' 
can't be used more than once (once Alice has input x, 
she can't change her mind and give another input). No- 
tice a third nice analogy, neither the quantum nor the 
NL machines allow for signaling. Indeed, in all cases the 
marginals are pure noise, independently of any input. 

Note that quantum physics is unable to produce the 
PR correlation J5J. Indeed, this correlation violates the 
Bell inequality (@J up to its algebraic maximum, M = 4, 
while Tsirelson's theorem states that quantum cor- 
relations are restricted to M < 2 + y/2. However, the 
correlation JSJ is much simpler than quantum correla- 
tions, while sharing many of their essential features. In 
particular JSJ) is nonlocal but non-signaling. 

In order to get some deeper understanding of the power 
of this hypothetical machine (J5J as a conceptual tool, let 
us consider 3 properties of quantum correlations (many 
further nice aspects can be found in 0iEEE3D- First 
we shall consider the so-called quantum no-cloning theo- 
rem and see that it is actually not a quantum theorem, 
but a no-signaling theorem. The next natural step is 
to analyze quantum cryptography, whose security is of- 
ten said to be based on the no-cloning theorem, and as 
we would expect by now, we shall find "non-signaling 
cryptography" . Finally, we consider the question of the 
communication cost to simulate maximal quantum cor- 
relation. But before all this we need to recali some facts 
about non-signaling correlations. 



A. The set of non-signaling correlations 

Let us consider the set of all possible bi-partite correla- 
tions P(a, b\x, y), where the inputs are taken from finite 
alphabets {x} and {y} and similarly for the outputs {et} 
and {&}, and which are non-signaling: 

y^P(a, b\x, y) = P(a\x) is independent of y (6) 

b 



P(a, b\x, y) — P(b\y) is independent of x (7) 

a 



A "machine" is a physicists's terminology for an input-output 
black-box that is not necessarily mechanical. I believe that this 
terminology appeared in the quantum physics context with the 
" optimal cloning machines" introduced by Buzek and Hillery |4Í_I 
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FIG. 2: Geometrical view of the set of correlations. The 
bottom part represents the convex set (polytope) of local 
correlations, with the upper facet corresponding to the Bell 
inequality (|1J. The upper triangle corresponds to the non- 
local non-signaling correlations that violate the Bell inequal- 
ity. The smooth thin curve limits the correlations achievable 
by quantum physics. The top of the triangle corresponds to 
the unique non-signaling vèrtex above this Bell inequality, i.e. 
to the non- local PR machine (JüJ. The thin vertical line 
represents the isotropic correlations © with the indication 
of some of the vàlues of pnl- 



A priori this set looks huge. But it has a nice struc- 
ture. First, it is a convex set: convex combinations of 
non-signaling correlations are still non-signaling. Sec- 
ond, there are only a finite number of extremal points 
(mathematician call such sets polytopes and the extremal 
point vertices); accordingly every non-signaling correla- 
tion can be decomposed into a (not necessarily unique) 
convex combination of extremal points. This is analog 
to quantum mixed states that can be decomposed into 
convex mixtures of pure states. 

Among the non-signaling correlations are the local 
ones, i.e. those of the form @, analog to separable 
quantum states. The set of local correlations also forms a 
polytope, a sub-polytope of the non-signaling one. More- 
over all vertices of the local polytope are also vertices of 
the non-signaling polytope, see Fig. 2 [4^. The facets of 
the local polytope are in one-to-one correspondence with 
all tight Bell inequality. 

Let us illustrate this for the simple binary case (which 
is in any case the only one we need in this article), i.e. 
a, b, x, y are 4 bits. In this case, it is known that there 
are only 8 non-trivial Bell inequalitics (i.e. not counting 
the trivial inequalities of the form P(a,b\x,y) > 0), i.e. 
only 8 relevant facets of the local polytope. Interestingly, 
Barret and co-workers [Ü| demonstrated that the non- 
signaling polytope has only 8 vertices more than the local 
polytope, exactly one per Bell inequality! Each of these 8 
vertices is equivalent to the PR correlation JSJ , up to an 



elementary symmetry (flip an input and/or an output). 
Although these polytopes live in an 8-dimensional space 8 , 
their essential properties can be recalled from the simple 
geometry of figure 2. 



B. No-cloning theorem 

Details can be found in ^t|j as nere we would merely 
like to present the intuition. Let us assume that Alice 
(input and output bits x and a, respectively) shares the 
correlation © both with Bob (bits y and b) and with 
Charly (bits z and c): a + b = xy and a + c = xz. Note 
that this situation is different from the case where Alice 
would share one "machine" with Bob and share another 
independent "machine" with Charly: in the situation un- 
der investigation Alice holds a single input bit x and a 
single output bit a. We shall see that the assumption 
that Alice's input and output bits x and a are correlated 
both to Bob and to Charly leads to signaling. Hence in 
a Universe without signaling, Alice can't share the cor- 
relation JSJ with more than one partner: the correlation 
can't be cloned. 

In order to understand this, assume that Bob and 
Charly come together, input y = 1 and z — 0, and add 
their output bits b + c. According to the assumed corre- 
lations and using the modulo 2 arithmetic a + a = 0, one 
gets: b + c = a + b + a + c = xy + xz = x. Hence, they 
could determine from their data that Alice's input bit is 
x, i.e. Alice could signal to them! 

A natural question is how noisy should the correlation 
(0 be to allow cloning? The answer is interesting: as long 
as the Alice-Bob correlation violates the Bell inequality 
the Alice-Charly correlation can't violate it; if not 
there is signaling. 

We have just seen that the CHSH-Bell inequality Q 
is monogamous, like well kept secrets. Let's now see that 
this is not a coincidence! 



C. Non-signaling cryptography 

In 1991 Artur Ekert's discovery of quantum cryptogra- 
phy [9j based on the violation of Bell's inequality changed 
the (physicist's) world: entanglement and quantum non- 
locality became respectable. Now, as we shall see in this 
subsection, the essence of the security of quantum cryp- 
tography does not come from the Hilbert space structure 
of quantum physics (i.e. not from entanglement), but is 
due to no-signaling nonlocal correlation! The fact that 
quantum physics offers a way to realize such correlation 
makes the idea practical. However, if one would find any 



8 More precis ely, 8 is the dimension of the space of non-signaling 
correlations 15(1. 
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other way to establish such no-signaling nonlocal corre- 
lacions (a way totally unknown today), then this would 
equally well serve as a mean to establish cryptographic 
keys lEU- 

Let us emphasize that the goal is to assume no restric- 
tion on the adversary's power, i.e. on Eve, except no- 
signaling 9 [s^[. Obviously, if one assumes additional re- 
strictions on Eve, like restricting her to quantum physics, 
then Alice and Bob can distill more secret bits from their 
data. But qualitatively, the situation would remain un- 
changed. 

Assume that two partners, Alice and Bob, hold devices 
that allow thcm to each input a bit (make a binary choicc 
of what to do, e.g. which experiment to perform) and 
each receives an output bit (e.g. a measurement result). 
This can be cast into the form of an arbitrary correla- 
tion: P(a, b\x,y), with a,b,x,y four bits. Assume fur- 
thermore that the devices held by Alice and Bob do not 
allow signaling. This simple and very natural assump- 
tion suffices to give a nice structure to the set of corre- 
lations P(a, b\x, y); as we recali in subsection IX Al this 
set is convex and has a finite numbcr of extreme points, 
called vertices. The nice property is that any correlation 
P(a, b\x,y) can be decomposed into a convex combina- 
tion of vertices, hence once one knows the vertices one 
knows essentially everything. If the correlation is local, 
i.e. of the form then it is not useful for cryptogra- 
phy; indeed the adversary Eve may know the strategy 
A. Hence, let's assume that P(a,b\x,y) violates the Bell 
inequality @. Consequently P(a,b\x,y) lies in a well 
defined còrner of the general polytope, a sub-polytope. 
Barrett and co-workers found that this sub-polytope has 
only 9 vertices ^8|, 8 local ones for which M = 3 and 
only one nonlocal vèrtex, that corresponding to our con- 
ceptual tool, i.e. to a + b = xy, for which M = 4, see 
Fig. 2. 

In the case that Alice and Bob are maximally corre- 
lated (maximally but non-signaling!), i.e. their correla- 
tion correspond to the nonlocal vèrtex of Fig. 2, it is 
intuitively clear that the adversary Eve can't be corre- 
lated neither to Alice, nor to Bob, by the no-cloning ar- 
gument sketched in the previous subsection. Hence, in 
such a case Alice and Bob receive from their apparatuses 
perfectly secret bits. However, these bits are not always 
correlated: when x = y = 1 they are anti-correlated. But 
this can be easily fixed by the following protocol. After 
Alice and Bob received their output bits, Alice announces 
publicly her input bit x and Bob changes his output bit to 
b 1 = b + xy. Now Alice and Bob are perfectly correlated 
and Eve still knows nothing about a and b'. 

Consider now that Alice and Bob are non maximally 



No-signaling should be understood here as in the previous sub- 
section on the no-cloning theorem. That is, even if two parties 
joint, for example Eve and Bob come together, then they should 
not be able to infer any information about the third party's input, 
e.g. Eve and Bob should not have access no Alice's input. 



correlated: 

P(a i b\x,y) = ~S(a + b= x-y) + (8) 

For pnl > this correlation violates the inequality 
for pnl < v2— 1, it can be realized by quantum physics. 
Can Alice and Bob exploit such a correlation for crypto- 
graphic usage secure against an arbitrary adversary who 
is not restricted by quantum physics, but only restricted 
by the no-signaling physics? The full answer to this fas- 
cinating question is still unknown. However, there is an 
optimistic answer if one assumes that Eve attacks each 
realization independently of the others, the so-called in- 
dividual attacks. In such a case, one may assume that 
Eve does actually distribute the apparatuses to Alice and 
Bob. Some apparatuses are ordinary local ones, for these 
Eve knows exactly the relation between the input and 
output bits, both for Alice and for Bob. For example, 
Eve sends to Alice an apparatus that always outputs a 
0, and to Bob an apparatus that outputs the input bit: 
b = y. In this example Eve knows Alice's bit a, but 
doesn't know Bob's bit. For some local pairs of appara- 
tuses Eve knows both a and ò, or b but not a. But, if 
the Alice-Bob correlation (JSJ violates the Bell inequality 
@, i.e. if pnl > 0, then Eve must sometimes send to 
Alice and Bob the apparatuses that produce the max- 
imal nonlocal correlation a + b = xy 10 , in which case 
she knows nothing about Alice and Bob's output bits a 
and b. A detailed analysis can be found in |5j|. Here 
we merely recali the result. For pnl > 0.318 the Shan- 
non mutual information between Alice and Bob is larger 
than the Eve-Bob mutual information [5]]]. Hence for 
Pnl > 0.318 Alice and Bob can distil a cryptographic 
secret key out of their data, secure even against an hypo- 
thetical post-quantum adversary, provided this adversary 
is still subject to no-signaling. 

Actually, in [Hï| we worked out a 2-way protocol for 
key distillation vàlid down to pnl > 0.09. There, it is 
also proven that the intrinsic information is positive for 
all positive pnl- It is thus tempting to conjecture that 
secret key distillation is possible if and only if the Bell 
inequality is violated 11 . 



One may think that Eve should sometimes send a weakly non- 
local machine. But all such correlations are convex combinations 
of local and fully non-local NL-machines. Hence, it is equivalent 
for Eve to always send either a local or a NL-machine, with 
ap prop riate probabilities. 

In |47j we proved that a correlation P(a, b\x, y) is nonlocal iff 
any possible non-signaling extensions P(a, 6, e\x,y, z) has pos- 
itive Alice-Bob condition mutual information, conditioned on 
Eve, I(A, B\E), i.e. has positive intrinsic information. This 
nicely complements the similar re sult that hol ds for entangled 
quantum states and purifications l53l . In l5ll we proved that 
the same relation between nonlocality and positive intrinsic in- 
formation does also hold when Alice announces her input and 
Bob adapts his output in such a way as to maximize his mutual 
information with Alice. Proving this in full generality would be 
a marvcllous result. 
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Another beautiful result is the observation of an Infor- 
mation gain versus disturbance relationship, very similar 
to that of quantum physics, based on Heisenberg's un- 
certainty relations |5J|. Let us analyze separately the 
cases where Alice announces x — and x = 1, and de- 
note the respective Alice-Bob error rates QBER X and the 
Eve-Bob mutual informations I X (B,E), i.e. QBER X = 
E y P(a^b)\x,y) and I X (B,E) = H(B\x) - H(B\E,x). 
Remarkably, Io(B,E) is a function of only QBER\ and 
Ii(B,E) of QBERq 12 : information gain for one input 
necessarily produces errors for the other input, in anal- 
ogy with the quantum case where information gain on 
on basis necessarily perturbs information encoded in a 
conjugated basis! 



To conclude this subsection, let us emphasize that the 
distribution of the correlation JSJ by quantum means re- 
quires a protocol that differs from the famous BB84 pro- 
tocol [55j. Indeed, the data obtained by Alice and Bob 
following the BB84 protocol do not violate any Bell in- 
equality, hence the BB84 protocol is not secure against 
a non-signaling post-quantum adversary. Indeed, even 
the noise-free BB84 data can be obtained from quantum 
measurements on a separable state in higher dimension. 
The additional dimension could, for the example of po- 
larization coding, be side-channcls due to accidental ad- 
ditional wavelength coding. Consequently, Standard se- 
curity proofs |56l l57| must make assumptions about the 
dimension of the relevant Hilbert spaces (accordingly, no 
security proof of quantum key distribution is uncondi- 
tional, contrary to widespread claims). But it is easy to 
adapt the BB84 protocol, it suffices that Alice measures 
the physical quantities corresponding to the Pauli ma- 
trices o~ z or a Xl depending on her input bit value or 
1, respectively, exactly as in BB84, and Bob measures 
in the diagonal bases: o~+m° and (7-450 for y = and 
y = 1, respectively. In this way Alice and Bob's data 
are never perfectly correlated, but they can violate the 
Bell inequality and be thus exploited to distil a secret key 
vàlid even against post-quantum adversàries. Note that 
the violation of a Bell inequality guarantees that no side 
channcls accidentally leak out information. Furthermore, 
in this protocol, that we like to call the CHSH-protocol, 
in honor of the 4 inventors 

El 

of the most useful Bell 
inequality (actually equivalent to I0J), Alice announces 
her input bit x, i.e. her basis as in BB84, but Bob doesn't 
speak, he always accepts and merely flips his bit in case 
x = y = 1. In summary, in the CHSH protocol Alice and 
Bob use all the raw bits, however their data are initially 
noisier than in the BB84 protocol. 



Precisely one has: I (B,E) = 2 ■ QBERi and h{B,E) = 2 ■ 
QBERq. 



D. Cost of simulating quantum correlations 

Among the many contributions of computer science to 
quantum information is the beautifully simple question 
(actually anticipated by Maudlin H^): what is the cost of 
simulating quantum correlations? More precisely, Gillcs 
Brassard, Richard Cleve and their student Alain Tapp 
(5í|, and independently Michael Steiner asked the 
question: How many bits must Alice and Bob exchange 
in order to simulate (projective) measurement outcomes 
performed on quantum systems? The question concerns 
the communication during the measurement simulation, 
clearly there must have been prior agreemcnt on a com- 
mon stratcgy. If the systems are in a separable state, no 
communication at all is nccdcd. On the contrary, if the 
state allows measurements that violate a Bell inequality, 
i.e. if the state has quantum nonlocality, then it is im- 
possible to simulate it without some communication or 
some other nonlocal resources. 

For the simplest case of two 2-level systems (2 qubits) , 
this game assumes that Alice and Bob receive as input 
any possible observable, i.e. any vector a and b of the 
Poincaré sphere. And they should output one bit, cor- 
responding to the binary measurement outcome "up" or 
"down" in the physicist's spin ^ language. A simple way 
to simulate the quantum measurements is that Alice com- 
municates her input a to Bob and outputs a predeter- 
mined bit (predetermined by Alice and Bob's common 
strategy). But communicating a vector corresponds to 
infinitcly many bits! My first intuition was that there 
is no way to do any better, after all the input space is a 
continuum, quite the contrary to the case of Bell inequal- 
ities where the input space is finite, usually even limitcd 
to a binary choice. Yet, Brassard and co-workers came 
out with a model using only 8 bits of communication! 
What a surprise: is entanglement that cheap? But this 
was only a start. Steiner published a model vàlid only for 
vectors lying on the equator of the sphere, but this model 
was easy to generalize to the entire sphere [6ÏJ: it uses 
only 2 bits! 2 bits, like in dense coding and teleporta- 
tion: that should be the end, I thought! But, yet again, 
I was wrong. Bacon and Toner produced a model using 
one single bit of communication |62|. Well, by now we 
should be at the limit, isn't it? But actually, not quite! 

Let's come back to the real central question: How does 
Nature manage to produce random data at space-like 
separated locations that can't be explained by common 
causes? The idea that Nature might be exploiting some 
hidden communication (hidden to us, humans) is inter- 
esting. With my group at Geneva University we spent 
quite some time trying to explore this idea, both exper- 
imentally and theoretically. We could set experimental 
bounds of the speed of this hypothetical hidden commu- 
nication [23. We also investigated the idea that each 
observer sends out hidden information about his result 
at arbitrary large speeds as defined in its own inertial 
reference frame 23] . The measured bounds on the speed 
of the hypothetical hidden communication were very high 
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and the latter assumption contradicted by experiments. 
Also our theoretical investigation cast serious doubts on 
the existence of hidden communication. Analyzing sce- 
narios involving 3 parties we could prové that if all quan- 
tum correlations would be due to hidden communication, 
then one should be able to signal (i.e. the hidden commu- 
nication do not remain hidden) |63l l64| ! Hence, the only 
remaining alternative is that Nature exploits both hid- 
den communication and hidden variables: each one sep- 
arately contradicts quantum theory, but both together 
could explain quantum physics. However, this seems 
quite an artificial construction. Hence, let's face the sit- 
uation: Nature is able to produce nonlocal data without 
any sort of communication. But is she doing so using 
all the quantum physics artillery? Aren't there logical 
building blocks of nonlocality? A partial answer follows. 

Let us come back to the problem of simulating quan- 
tum measurements, but instead of a few bits of commu- 
nication let us give Alice and Bob a weaker resource: one 
instance of the nonlocal machine a + b = xy. That this 
is indeed a weaker resource follows from the observation 
that the correlation a + b = xy can't be used to commu- 
nicate any bit, but that by sending a single bit one can 
easily simulate the nonlocal correlation (since Alice's in- 
put is only a bit x, it sufhces that she communicates it to 
Bob). The nice surprise is that this elementary resource 
is sufficient to simulate any pair of projective measure- 
ments on any maximally entangled state of two qubits! 
For the proof the reader is referred to the original arti- 
cle |H3 and to the beautiful account in where the 
relations between all these models are presented. 

The above results are very encouraging. One can gets 
the feeling that, at last, one can start understanding non- 
locality without the Hilbert space machinery, that, at 
last, one can study quantum physics from the outside, 
i.e. from the perspective of future physical theories (as- 
suming these will keep Einstein's no-signaling constraint) 
and no longer from the perspective of the old classical me- 
chanical physics. But there is still a lot to be done! For 
instance, it is surprising (and annoying in my opinion) 
that one is still unable to simulate measurement on par- 
tially entangled states using the nonlocal correlation (ac- 
tually we could prové that this is impossible with a single 
instance of the correlation, but there is hope that one can 
simulate partially entangled qubit pairs with 2 instances 
[67|h Let me emphasize that all of today's known simu- 
lation models for partially entangled qubits include some 
sort of communication 13 62] , let's say from Alice to Bob. 
Consequently, in all these models Bob can't output his 
results before Alice was given her input. This contrasts 
with the situation in quantum measurements where Bob 
doesn't need to wait for Alice (he does not even need to 
know about the existence of Alice) and with the simu- 



lation model for maximally entangled qubits using the 
PR-box. It would be astonishing if partially entangled 
state could not be simulated in a time-symmetric way 

XI. CONCLUSION 

The history of non-locality in physics is fascinating. 
It goes back to Newton (section [l·l] It first acceler- 
ated around 1935 with Einstein's EPR and Schròdinger 
cat's papers. Next, it slowly evolved, with the works of 
John Bell, John Clauser and Alain Aspect among many 
others, from a mere philosophical debaté to an experi- 
mental physics question, or even to experimental meta- 
physics as Abner Shimony nicely put it [70j . Now, during 
the last decade, it has run at full speed. Conceptually 
the two major breakthroughs were, first Artur Ekert's 
1991 PRL which strongly suggests a deep link between 
non-locality and cryptography, section IX Cl The second 
breakthrough, in my opinion, is the PR-box, section lX Al 
the understanding that non-signaling correlations can be 
analyzed for themselves, without the need of the usual 
Hilbert space artillery, thus providing a simple concep- 
tual tool for the unravelling of quantum non-locality. We 
have reviewed that the no-cloning thcorem, the uncer- 
tainty relation, the monogamy of extreme correlation and 
the security of key distribution, all properties usually as- 
sociated to quantum physics are actually properties of 
any theory without signaling, section Q In particular 
we emphasized that the second breakthrough, the PR- 
box, allows one to confirm the first breakthrough: there 
is an intimate connection between violation of a Bell in- 
equality and security of quantum cryptography. 

And relativity, can it be considered complete? Well, 
if nonlocality is really real, as widely supported by the 
accounts summaries in this article, then all complete the- 
ories should have a place for it. Hence, the question is: 
"Does relativity hold a place for non-signaling nonlocal 
correlations?" . 
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Usin g th e reduction of an OT-box (Oblivious Transfer to a PR- 
box) 68] one can simulate any 2-qubit state with one OT-box. 
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